ANASTASIL: A Hybrid Knowledge-Based System for Document Layout Analysis
نویسندگان
چکیده
This paper describes a know ledge-based system for the identification of the different regions of a document image. It uses a hybrid, modular knowledge representation, a so called geometric tree being its essential part. This tree is used to perform a best-first search in combination with a "hypothesize & test"strategy. It produces an internal, editable description of the entire document and its constituents. The system has been implemented for the analysis of single-sided business letters in Common Lisp on a SUN 3/60 Workstation. It is running for a large population of different business letters. The results obtained have been very encouraging and have convincingly confirmed the soundness of the approach.
منابع مشابه
A knowledge-based approach to the layout analysis
In this paper, we present a hybrid approach to the problem of the document analysis in which the document image is segmented by means of a top-down technique and then basic blocks are grouped bottom-up in order to form complex layout components. In this latter process, called layout analysis,only generic knowledge on typesetting conventions is exploited. Sucha knowledge isindependentof theparti...
متن کاملA Hybrid Meta-heuristic for the Dynamic Layout Problem with Transportation System Design
This paper primarily presents a comprehensive dynamic layout design model which integrates layout and transportation system design via considering more realistic assumptions, such as taking account of fixed-position departments and distance between departments that endanger each other. In addition, specific criteria such as capacity, cost and reliability of facilities are considered in transpor...
متن کاملDevelopment of an Intelligent Cavity Layout Design System for Injection Molding Dies (RESEARCH NOTE)
This paper presents the development of an Intelligent Cavity Layout Design System (ICLDS) for multiple cavity injection moulds. The system is intended to assist mould designers in cavity layout design at concept design stage. The complexities and principles of cavity layout design as well as various dependencies in injection mould design are introduced. The knowledge in cavity layout design is ...
متن کاملDocument Structure Analysis Based on Layout and Textual Features
Document image processing is a crucial process in the office automation and begins from the ’OCR’ phase with difficulty of the document ’analysis’ and ’understanding’. This paper presents a hybrid and comprehensive approach to document structure analysis. Hybrid in the sense, that it makes use of layout (geometrical) as well as textual features of a given document. These features are the base f...
متن کاملProject D.A.M.A.: Document Acquisition, Management and Archiving
A paper document processing system is an information system component which transforms information on printed or handwritten documents into a computer-revisable form. In intelligent systems for paper document processing this information capture process is based on knowledge of the specific layout and logical structures of the documents. In this project we design a framework which combines techn...
متن کامل